Graph Sparsification via Refinement Sampling
نویسندگان
چکیده
A graph G(V,E) is an ǫ-sparsification of G for some ǫ > 0, if every (weighted) cut in G is within (1 ± ǫ) of the corresponding cut in G. A celebrated result of Benczúr and Karger shows that for every undirected graph G, an ǫ-sparsification with O(n log n/ǫ) edges can be constructed in O(m log n) time. The notion of cut-preserving graph sparsification has played an important role in speeding up algorithms for several fundamental network design and routing problems. Applications to modern massive data sets often constrain algorithms to use computation models that restrict random access to the input. The semistreaming model, in which the algorithm is constrained to use Õ(n) space, has been shown to be a good abstraction for analyzing graph algorithms in applications to large data sets. Recently, a semi-streaming algorithm for graph sparsification was presented by Anh and Guha; the total running time of their implementation is Ω(mn), too large for applications where both space and time are important. In this paper, we introduce a new technique for graph sparsification, namely refinement sampling, that gives an Õ(m) time semi-streaming algorithm for graph sparsification. Specifically, we show that refinement sampling can be used to design a one-pass streaming algorithm for sparsification that takes O(log logn) time per edge, uses O(log n) space per node, and outputs an ǫ-sparsifier with O(n log n/ǫ) edges. At a slightly increased space and time complexity, we can reduce the sparsifier size to O(n logn/ǫ) edges matching the Benczúr-Karger result, while improving upon the Benczúr-Karger runtime for m = ω(n log n). Finally, we show that an ǫ-sparsifier with O(n log n/ǫ) edges can be constructed in two passes over the data and O(m) time whenever m = Ω(n) for some constant δ > 0. As a by-product of our approach, we also obtain an O(m log logn + n logn) time streaming algorithm to compute a sparse k-connectivity certificate of a graph. Departments of Management Science and Engineering and (by courtesy) Computer Science, Stanford University. Email: [email protected]. Research supported in part by NSF award IIS-0904325. Institute for Computational and Mathematical Engineering, Stanford University. Email: [email protected]. Research supported by a Stanford Graduate Fellowship. Department of Computer and Information Science, University of Pennsylvania, Philadelphia PA. Email: [email protected]. Supported in part by NSF Awards CCF-0635084 and IIS-0904314.
منابع مشابه
Efficient Sampling for Gaussian Graphical Models via Spectral Sparsification
Motivated by a sampling problem basic to computational statistical inference, we develop a toolset based on spectral sparsification for a family of fundamental problems involving Gaussian sampling, matrix functionals, and reversible Markov chains. Drawing on the connection between Gaussian graphical models and the recent breakthroughs in spectral graph theory, we give the first nearly linear ti...
متن کاملDrawing Big Graphs Using Spectral Sparsification
Spectral sparsification is a general technique developed by Spielman et al. to reduce the number of edges in a graph while retaining its structural properties. We investigate the use of spectral sparsification to produce good visual representations of big graphs. We evaluate spectral sparsification approaches on real-world and synthetic graphs. We show that spectral sparsifiers are more effecti...
متن کاملSpectral Sparsification and Restricted Invertibility
Spectral Sparsification and Restricted Invertibility Nikhil Srivastava 2010 In this thesis we prove the following two basic statements in linear algebra. Let B be an arbitrary n×m matrix where m ≥ n and suppose 0 < ε < 1 is given. 1. Spectral Sparsification. There is a nonnegative diagonal matrix Sm×m with at most dn/ε2e nonzero entries for which (1− ε)2BBT BSBT (1 + ε)2BBT . Thus the spectral ...
متن کاملGraph Sparsification Approaches for Laplacian Smoothing
Given a statistical estimation problem where regularization is performed according to the structure of a large, dense graph G, we consider fitting the statistical estimate using a sparsified surrogate graph G̃, which shares the vertices of G but has far fewer edges, and is thus more tractable to work with computationally. We examine three types of sparsification: spectral sparsification, which c...
متن کاملLecture 04 / 13 : Graph Sampling and Sparsification
In previous lectures, we discussed various topics on graph: sparsest cut, clustering which finds small k-cuts, max flow, multicommodity flow, etc. For very large graphs, we’d like to ask “instead of regular approximation algorithms which run in polynomial time, can we find any near-linear time algorithm for minimum cut or maximum flow problems?”. A simple idea is for a given graph G, to solve t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1004.4915 شماره
صفحات -
تاریخ انتشار 2010